Self-organizing Maps in Natural Language Processing

نویسنده

  • Timo Honkela
چکیده

Kohonen's Self-Organizing Map (SOM) is one of the most popular arti cial neural network algorithms. Word category maps are SOMs that have been organized according to word similarities, measured by the similarity of the short contexts of the words. Conceptually interrelated words tend to fall into the same or neighboring map nodes. Nodes may thus be viewed as word categories. Although no a priori information about classes is given, during the self-organizing process a model of the word classes emerges. The central topic of the thesis is the use of the SOM in natural language processing. The approach based on the word category maps is compared with the methods that are widely used in arti cial intelligence research. Modeling gradience, conceptual change, and subjectivity of natural language interpretation are considered. The main application area is information retrieval and textual data mining for which a speci c SOM-based method called the WEBSOM has been developed. The WEBSOM method organizes a document collection on a map display that provides an overview of the collection and facilitates interactive browsing. 1

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning to Understand - General Aspects of Using Self-Organizing Maps in Natural Language Processing

The Self-Organizing Map (SOM) is an artificial neural network model based on unsupervised learning. In this paper, the use of the SOM in natural language processing is considered. The main emphasis is on natural features of natural language including contextuality of interpretation, and the communicative and social aspects of natural language learning and usage. The SOM is introduced as a gener...

متن کامل

Self-Organizing Maps of Words for Natural Language Processing Applications

Kohonen’s Self-Organizing Map (SOM) is a general unsupervised tool for ordering highdimensional statistical data so that neighboring nodes on the map represent similar inputs. Often the SOM is applied to numerical data in application areas like pattern recognition, signal processing, and multivariate statistical analysis. The SOM can also be used to find statistical similarities between symbols...

متن کامل

Indexing Audio Documents by using Latent Semantic Analysis and SOM

This paper describes an important application for state-of-art automatic speech recognition , natural language processing and information retrieval systems. Methods for enhancing the indexing of spoken documents by using latent semantic analysis and self-organizing maps are presented, motivated and tested. The idea is to extract extra information from the structure of the document collection an...

متن کامل

Creating an Order in Distributed Digital Libraries by Integrating Independent Self-Organizing Maps

Digital document libraries are an almost perfect application arena for un-supervised neural networks. This because many of the operations computers have to perform on text documents are classiication tasks based on \noisy" input patterns. The \noise" arises because of the known inaccuracy of mapping natural language to an indexing vocabulary representing the contents of the documents. A growing...

متن کامل

A parallel growing architecture for self-organizing maps with unsupervised learning

Self-Organizing maps (SOM) have become popular for tasks in data visualization, pattern classification or natural language processing and can be seen as one of the major concepts for artificial neural networks of today. Their general idea is to approximate a high dimensional and previously unknown input distribution by a lower dimensional neural network structure with the goal to model the topo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997